Widder et al. BMC Systems Biology 2012, 6:7 
http://www.biomedcentral.eom/1752-0509/6/7 

Systems Biology 



RESEARCH ARTICLE Open Access 



Evolvability of feed-forward loop architecture 
biases its abundance in transcription networks 

Stefanie Widder 1,2 ", Ricard Sole 2,3 and Javier Macia 2 




Abstract 

Background: Transcription networks define the core of the regulatory machinery of cellular life and are largely 
responsible for information processing and decision making. At the small scale, interaction motifs have been 
characterized based on their abundance and some seemingly general patterns have been described. In particular, 
the abundance of different feed-forward loop motifs in gene regulatory networks displays systematic biases 
towards some particular topologies, which are much more common than others. The causative process of this 
pattern is still matter of debate. 

Results: We analyzed the entire motif-function landscape of the feed-forward loop using the formalism developed 
in a previous work. We evaluated the probabilities to implement possible functions for each motif and found that 
the kurtosis of these distributions correlate well with the natural abundance pattern. Kurtosis is a standard measure 
for the peakedness of probability distributions. Furthermore, we examined the functional robustness of the motifs 
facing mutational pressure in silico and observed that the abundance pattern is biased by the degree of their 
evolvability. 

Conclusions: The natural abundance pattern of the feed-forward loop can be reconstructed concerning its 
intrinsic plasticity. Intrinsic plasticity is associated to each motif in terms of its capacity of implementing a 
repertoire of possible functions and it is directly linked to the motif's evolvability. Since evolvability is defined as 
the potential phenotypic variation of the motif upon mutation, the link plausibly explains the abundance pattern. 



Background 

Evolutionary adaptability in biological systems is often 
the result of trade-offs between flexibility and specializa- 
tion [1]. In this context, buffering mutations and noise 
seem an important requirement for stability. This can 
be achieved by a robust response to parameter changes 
and correlates with the degree of specialization of the 
given structure. A given network insensitive to muta- 
tions will always perform the same function. On the 
other hand, adaptation and evolvability requires flexible 
structures that can be re-used to perform different 
(potential) functions and thus provide plasticity [2,3]. 
The problem here is often understanding why some par- 
ticular structures are so common and what their (if any) 
functional meaning is. This is closely tied to the map- 
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ping /between structure S and function F, namely the 
relationship: 



(1) 



which is usually dubbed as the genotype-phenotype 
mapping problem [4]. Understanding the nature and 
origins of this mapping is at the core of many key ques- 
tions concerning the evolution of complexity in nature. 

Within the context of gene transcription networks, it 
has been suggested that the previous problem can be 
dissected by analyzing the frequency of some overabun- 
dant sub-networks of three or four elements, so called 
network motifs [5-7]. These sub-graphs only capture the 
topological pattern of connections and a dynamical 
description of their potential function requires a set of 
differential equations [8,9]. One particularly important 
example is provided by feed-forward loop (FFL) motifs 
[5]. Many genetic and biochemical systems, such as the 
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Lac and Che systems in E. coli (responsible for lactose 
utilization and chemotaxis, respectively) involve FFL 
motifs [10-12]. Mounting evidence indicates that they 
have key roles in cell function [10] and morphogenesis 
[13,14]. 

However, the origin of a preferential bias towards 
given topologies remains under discussion. 

The relative frequency of FFLs displays a well-defined 
pattern (figure lc) dominated by two sub-graphs (CI 
and II). The uneven abundance of these graphs could 
be a fingerprint of their functional relevance [8,15-17]. 
Such importance would be the blueprint of an evolu- 
tionary advantage, but it is not clear whether such func- 
tional connection really exists [15,18-21] or if it 
resembles instead a byproduct of non-adaptive processes 
[22-25]. As shown below, motif structure does not 
directly relate to its frequency, but its plasticity in 
implementing different functions does. 
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Figure 1 Structure and frequency of FFL motifs. In (a) we show 
the schematic representation of the FFL's genetic regulatory 
interactions ('+' represents activatory regulation and '-' represents 
inhibitory regulation). The external input / activates the signal 
protein X. Active X modulates expression of gene G z directly and 
indirectly via regulation of Y expression, which in turn also regulates 
G z . The dynamics of these regulatory interactions is described by a 
set of equations dy/dt = F{y z), dz/dt = G{y z) incorporating the 
nonlinearities associated to gene-gene interactions. In (b) we plot 
the general topology of FFL motifs and the six different functions q> 
(f) represented by qualitative time-courses [Z(f)]. 'G' indicates grader 
dynamics, 'P' pulser dynamics. We specifically take into account the 
initial slope of the time-course ('+' or '-') and the concentration of 
the final target Z with respect to the non-induced protein 
concentration (T+' and T-'). In (c) we display the relative abundance 
PobsFD of these motifs in the transcription networks of yeast and £ 
coli (data from [27]). 



Results and Discussion 

Probability distribution of implementing different 
functions 

Consider the FFL graphs I\ from the set 
S = {Ci, C2, C3, C4,h,l2,hfh} shown in figure la. In 
previous studies [9,26] it has been shown that, the 
topology of a given FFL does not univocally define its 
function but it captures the probability distribution of 
implementing different functions. Our first goal is to 
identify an appropriate mapping / between FFL topology 
and each potential response (pj(t), i.e.fj : x(0), ft} — » 
(Pj(t) where we indicate as {T if #(0),//} the FFL graph 
together with its initial condition #(0) and the set of 
parameters used ft [26]. The six different responses (fig- 
ure lb) are triggered by an external input. These are 
either fast response (pulser) or delayed response (grader) 
considering the target concentration of the output. Here 
fij indicates the likelihood that (p ; (t) is implemented by 

r,. 

How likely is a motif to become part of a complex cel- 
lular network? Two extreme strategies can be envi- 
sioned. In the first, specific motifs play specific roles in 
a robust way and they are common because they are 
insensitive to mutational noise. In the second, the larger 
the variety of implementable functions, the more flexible 
the better. Such a scenario is feasible under the premise 
of ever-changing environments and comes with the cost 
of reduced robustness. In order to measure the plasticity 
of decision-making between these two strategies, let us 
first determine the (conditional) probability^ = P((pj\Ti). 
These probabilities are normalized, i. e. 

= 1 and can be systematically computed 
[26]. This set actually defines our structure-function 
map, namely 



(2) 



and can be displayed (figure 2a) as a weighted, motif- 
function bipartite graph (see Methods). The graph 
reveals that most motifs implement all functions, but 
the likelihood of each pair is case-dependent. Some 
motifs seem clearly more specialized (such as C4) 
whereas others are rather generalists (see for example 
74). What influences the choice of a given topology over 
others? Since most of the functions can be implemented, 
it is not clear that a one-to-one, function-based argu- 
ment will work. But we can go a step beyond and look 
at the structure of the probability distribution {P((pj\Ti)} 
of each topology. This can be done by measuring the 
degree of homogeneity displayed, for each T b by plotting 
P((pj\Ti). If the motif is highly specialized, some domi- 
nant peak(s) would be observed, whereas if it is very 
flexible no prominent peaks will appear. A simple way 



Widder et al. BMC Systems Biology 2012, 6:7 
http://www.biomedcentral.eom/1752-0509/6/7 



Page 3 of 9 




'1 ■□ 

I3 L_.__. 
"■UU_J . I _ 0 
99 77 7? 

H 7 1 H + 



Figure 2 FFL function and their probabilities, (a) The landscape 
of FFL motifs is displayed as a bipartite graph linking patterns 
(upper row) and processes (lower). The weight of the links indicates 
the relative probability P{<pj | Tj) that a given motif r,- implements a 
given function q)j. In (b) the matrix of motif-function probabilities is 
displayed using a color scale. The plots highlight that some motifs 
look more specialized, whereas others display rather evenly 
distributed functional responses. 



of measuring the homogeneity of the distribution is 
given by its kurtosis, defined as the fourth standardized 
moment by 



K 



&4 

fef 



(3) 



Kurtosis is the measure of the "peakedness" of a distri- 
bution. It quantifies the concentration of frequencies 
around the mean of the distribution. Higher kurtosis 
means that the variance is the result of infrequent, 
extreme deviations from the mean as opposed to fre- 
quent, modestly sized deviations resulting in low kurto- 
sis. In order to define a measure characterizing the 
degree of plasticity of a given motif in terms of its spe- 
cialization or its flexibility, we can consider two extreme 
cases, namely the most flexible graph Tf equally likely to 
implement any function q>j, and the most specialized 
graph T s implementing only one function (p lt In the first 
case we would have P(cpj = 1/6 and the kurtosis 
associated is K{Tfj = -3.33, whereas in the second case 
we would have P(T S ) = 1 and 0 otherwise, with kurtosis 
K(r s ) = 6. Details on the calculation can be found in 
Methods. Any other FFL graph I\ from the set 
S = {Ci, C2, Cs, C4, l\, \<i, I3, 14} has kurtosis values locat- 
ing within this interval, K(Ti) e (K(Tfi, K(T S )). 

In order to measure the degree of plasticity in the 
decision-making between these extreme cases we intro- 
duce ^(r,). y/{Td is the distance between the absolute 
value of the kurtosis K(Ti) and the origin K 0 [26] or in 
other words represents the intermediate level between 
specialization and flexibility. This transformation opens 
the way for a more intuitive biological interpretation: 
The values for y/(Ti) range between high plasticity (low 
y/(Ti)) and high commitment towards one of the 
extreme strategies, i.e. maximal specialization or maxi- 
mal flexibility (high y/(Ti)). The optimal solution here is 



likely to be strongly impacted by the predictability of 
the environment. As a first approximation we therefor 
place K 0 at the midpoint of the interval {Kf,K s ), i.e. 



if{r i ) = \K[r i )\-K 0 



(4) 



Finally we define the likelihood p(T/) of a given motif 
to appear within a network as a function of its ^(r^). 
Assuming that a high degree of flexibility or specializa- 
tion should be related with lower likelihood of appear- 
ance for a given motif T if we write 



a 



(5) 



where a is a normalization coefficient defined as 

« = Ejli wr 1 . 

This function actually defines the expected probability 
of finding a given sub-graph and is thus mapping 
between the distributions associated to each motif and 
the expected abundance of motifs within networks. Fig- 
ures 3a and 3b show the correlation between relative 
abundances of FFL motifs in E. coli and S. cerevisiae 
with respect to their expected probabilities p(Ti). The 
matching is striking (data concerning abundances 
obtained from [27]). The two most abundant graphs (CI 
and II) are consistent with our results and the actual 
distribution matches well the observed pattern. 

Interestingly, the expected probabilities indicate a 
positive bias toward systems which show high plasticity 
as presented in figure 4. Intermediate values for kurtosis 
(figure 4a) or low ^(I\) (figure 4c) correlate with an 
increase in the likelihood of appearance. The values for 
kurtosis, y/(Ti) and p(r,) are collected in table 1. As an 
alternative measure for the homogeneity of a probability 
distribution the Shannon entropy is discussed in the 
Methods section, where the motif s entropy is used to 
characterize the degree of flexibility or specialization of 
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Figure 3 Predicted probability and FFL abundance. In (a) we 

compare the natural abundance and its predicted counterpart p(T,-). 
5. cerevisioe (black box) is compared to £ coli (white box) and the 
predicted probabilities (black triangle). In (b) we present the 
correlation between p(r ; ) and the natural abundances. The Pearson 
coefficient for the linear fit is r = 0.91 and r = 0.94 for £ coli and S. 
cerevisioe, respectively. 
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Figure 4 FFL plasticity and abundance. Here we compare the 
measures kurtosis, entropy and \|/(r,) and their correlation with the 
abundance of NW motifs (S. cerevisioe shown in black and E.coli in 
white). In (a) the kurtosis of the motifs probability distribution for 
different functions versus the motifs abundance is plotted. In (b) we 
show entropy versus abundance. The most abundant motifs have 
intermediate values for both measures which can be interpreted as 
high plasticity in both cases. In (c) we present the correlation 
between (//(r,) and abundance. y/{yl) correlates negatively and thus 
again, plasticity correlates positively with motif abundance. 



a given motif I\ [26] . Both, entropy (figure 4b) and kur- 
tosis (figure 4a) yield similar results, ranking the most 
abundant motifs with intermediate values which trans- 
lates to high functional plasticity. 

For the less abundant motifs we see a more disordered 
trend in the two measures, as is the case for C3, C4, 13 
(both measures) or C2 (entropy). The interpretation 
here is not straightforward. It is feasible that the disor- 
dered trends can be consequence of non-adaptive pro- 
cesses. An alternative hypothesis is related to the shape 
of the real distributions for the implementation of any 
function. We assume that for more and less frequent 
motifs the analytically deduced probability distributions 
does not fit equally well the real counterpart. The more 
abundant the motif, the better the underlying probability 
distribution is mirrored in its abundance, because the 
sampling space is covered more readily. 

Our analysis of FFLs dynamics was performed consid- 
ering single, isolated motifs. However, in real systems 
motifs are embedded in large networks allowing for the 
combination of motifs. The combination of more abun- 
dant motifs, such as CI and II, can cover the whole set 
of possible dynamics by that affecting the abundance of 
the rest of the motifs. 



Table 1 Kurtosis, y/(Tj) and predicted probability p(r,) 
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Table 1 shows the values for the motifs' kurtosis, ^(r,) and predicted 
probability p(JTi). The entries are calculated from the probability distributions 
according to equations 3,4 and 7. 



Evolvability 

In order to have a relevant role in evolvability, the degree 
of plasticity of FFLs should correlate with the motif s capa- 
city of generating phenotypic variation by exploring differ- 
ent functions under mutation. Two key aspects are of 
importance here, namely i) the reduction of mutational 
lethality and ii) the up-speeding of adaptational processes 
(reduction of the number of mutations needed to generate 
new phenotypes [1]). The evolvability of the circuits T k 
can be studied by calculating the transition probabilities 
co™ of shifting from function q> t to (pj under m muta- 
tions. The matrix Q m {k) = {co™) defines a flow graph (fig- 
ure 5b) which allows us quantifying its evolvability s{Ti\ 
We compared the robustness against single mutations ver- 
sus sequential accumulation of multiple mutations. Muta- 
tions m are defined as single parameter changes. For 
better understanding of the procedure, we want to stress 
the conceptual difference of continuous real mutations 
and the here applied parameter changes m which result in 
a discretized observation pattern. In the presented 
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Figure 5 Computation of the transition frequencies Sketch of 
the procedure. In (a) the update rule is shown. For a given string of 
conditional relations (BR) the associated dynamical pattern is 
calculated at time-step t - 1. Next the entries of BR are mutated at 
time-step f and the (new) dynamical pattern is evaluated. Then the 
transition of pattern M to pattern f is binned. This protocol is 
executed until no more changes in the bins occur as shown in plot 
(c). In (b) the graph Q m (T,-) associated to the transitions between 
the possible types of dynamics is represented for C1. The thickness 
of the arrows correspond to transition probabilities obtained from 
procedure (a). From these graphs (see Methods) we can calculate 
the motif evolvability s, which is found to positively correlate with p 
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framework a mutation m can be both, numerically small 
or large without impact as long as it does not drive the 
system into another functional regime. 

Robustness is defined as the sum of the diagonal ele- 
ments of H m (/c), L e. ^2iCO^{<pi\<pi). The diagonal elements 
<^(0il0i) give the probabilities of performing the same 
function after m mutations. A detailed description of the 
procedure is given in Methods. We have found that FFL 
motifs are very robust but some of them exhibit a high 
phenotypic variation under repeated mutations. The most 
abundant motifs, CI and II, show the highest phenotypic 
variation. In other words, CI and II can widely change 
their function with greater ease than the rest of the circuits 
facilitated by their low y/(Ti) (figure 6b). A network dis- 
playing little phenotypic diversity would give small values 
of s whereas sub-graphs with high transition rates among 
states will have a high s. As presented in figure 6a, £ corre- 
lates positively with the abundance of motifs, with CI and 
II displaying the largest values. These results suggest that 
a proper degree of plasticity, in terms of a balance between 
flexibility and specialization, is the optimal strategy to 
increase evolvability providing the playground for adaptive 
responses without increasing mutational lethality. 

Assuming that motif plasticity is a relevant trait, our 
analysis supports the idea that the observed FFL abun- 
dance pattern actually correlates with motif evolvability. 

Our analysis suggests that neither a direct interpreta- 
tion of motifs as functional modules [1,2,4] nor a purely 
non-adaptive view of their abundance [22-24] account 
for the uneven presence in transcription networks. Con- 
sistently with previous works [28,29] duplication-rewir- 
ing dynamics alone cannot explain the evolution of 
FFLs. The potential for evolvability associated to their 
topological structure might well be the missing ingredi- 
ent connecting both views. 



b 0,4E 



-trr- 



»X1 



Figure 6 FFL evolvability. In plot (a) we show the correlation 
between the motifs evolvability and its abundance. Evolvability s, 
which is found to positively correlate with p(T/), is highest for the 
most abundant motifs C1, /1. In black we show data point of 5. 
cerevisiae, in white E. coli. In (b) we show the correlation between 
the FFL's evolvability and its y/{Ti). We calculate a Pearson coefficient 
of r = -0.92 for the linear fit. The lower y/(Ti), the higher the motifs 
plasticity and the higher its evolvability. The data points are 
developed from the motifs topology and thus are species 
independent (blue). 



Conclusions 

In this article we have interpreted a simplified, qualita- 
tive model of the FFL motif. The thorough analysis 
within the model framework allows to reconstruct its 
natural abundance pattern and provides insight in what 
might have shaped it. The argument leads to the very 
core of the genotype-phenotype mapping problem, since, 
due to its simplicity, a perfect mapping between the 
topology and all possible functions it can implement can 
be constituted. We claim, however, general applicability. 
FFL abundances are correlated with their plasticity and 
evolvability. Evolvability has been defined as a compro- 
mise between robustness against single mutations and 
the capability to modify the functional response upon 
increasing mutational pressure. The results indicate that 
a proper portion of intrinsic functional plasticity, which 
can be understood as a strategic trade-off between spe- 
cialization and flexibility, is necessary to be abundant. 
Because only then one is suited to be readily evolvable 
in changing environments. 

Future work should be devoted to analyzing how the 
coexistence of different motifs embedded in a large net- 
work affects their dynamics and abundance compared to 
the single motif analysis performed in this work. 

Methods 

Dynamical response of FFL motifs 

Network motifs are recurrent interaction patterns, which 
are significantly more often encountered in biological 
interaction graphs than expected from random nets. It has 
been shown that feed forward loops (FFL) are capable of 
processing external signals by responding in a very speci- 
fic, robust manner, either accelerating or delaying 
responses. They are composed of three genes. Firstly, gene 
G x that expresses the protein X. This protein X regulates 
the expression of the other two genes G Y and G z encoding 
the proteins Y and Z, respectively. Additionally, protein Y 
regulates the expression of Z (see figure 1). Here, we 
assume that expression of X is unregulated and the protein 
is expressed in its inactive form, i.e. X does not regulate 
the expressions of Y and Z straightaway. Only upon the 
presence of an external signal (the input) X becomes active 
and regulation of Y and Z takes place, where Z resembles 
the output of the motif. The dynamics describing how the 
concentration of Z changes during time from the initial 
state (without input) to the final state (with input) are cal- 
culated from a set of two differential equations [26]: 



Y=y Y 



1 + a x co x y X n 
1 + co x X n 



Yz 



1 + p x co x z X n + pyoo{Y m + ^ojfX n Y n 



1 + co x X n + co y z Y m + 0)7 X n Y 



■SyY 

(6) 
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Here y t describes the basal production of protein /, 
with i = {Y, Z}, subsuming the concentration of all bio- 
chemical elements which remain constant in time. The 
binding equilibrium of the regulators j with the gene G t 
are denoted by oJ., with ; = {X, Y, Z}. Parameters a* and 
P define the type of regulatory interactions, i.e. activa- 
tion or inhibition, for gene G Y and G z , respectively, pro- 
viding the regulatory rates with respect to the basal 
transcription. Values < 1 correspond to inhibitory regu- 
lation, whereas > 1 accounts for activation (denoted by 

and V in figure la respectively). The parameter fi xy 
accounts for the simultaneous regulation of G z . The 
degradation rate of protein i is denoted as Finally, n 
and m are the degree of multimerization of the 
regulators. 

If we consider the system in phase space, we find that 
in absence of input the system resides in a stable steady 
state determined by the crossing of the nullclines, Y = 0 
and Z = 0, respectively. Upon external input, X is acti- 
vated and hence the shapes of the nullclines change. 
They provide a new crossing and consequently a new 
steady state. Due to these changes in the nullclines' geo- 
metry the system must evolve from the initial state 
towards the new stable state. The evolution corresponds 
to a trajectory crossing phase space that depends on i) 
the location of the initial state, ii) the location of the 
final state, iii) the new shapes of the nullclines upon 
input. The specific dynamics implemented by a given 
motif is determined by this trajectory, which depends on 
the set of parameters. However, by analyzing the geome- 
trical features of the nullclines it is possible to deter- 
mine the so-called Backbone of Requirements for the 
FFL response (BR), i.e. a set of qualitative relationships 
between different geometrical features of the nullclines 
and the location of the initial and the final point that 
univocally determines the dynamics [26]. Therefore, for 
a given FFL motif all different sets of parameters satisfy- 
ing the same BR implement the same function. Simi- 
larly, also different BRs may implement the same 
function. 

Based on the analysis of the different BRs associated 
with a given motif and their impact on the motif s func- 
tion, we are in the position to determine a distribution 
of probabilities for the implementation of any function 
(see [26] for details about quantification of the dynami- 
cal probabilities). Table 2 shows the conditional prob- 
ability that each motif T t (rows) implements the 
function q)j (columns). 

Functional robustness and mutational perturbation 

Parametric mutations have different impact on the 
motif s function, as in nature they can either be neutral 
or causing qualitative changes. For the system we 
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Table 2 shows the conditional probability for each motif r, (rows) to 
implement the function <pj (columns).These probability distributions are 
calculated from the association of a given motif, its possible BRs and the 
respective functions, as described in [26]. 



present here, only those mutations cause functional 
change, which induce a qualitative alteration in the 
shape of the nullclines, represented in the Backbone of 
requirements for the FFL response (BR) [26]. However, 
the mutation will become visible only, if the resulting 
BR is actually associated with a different function. 

To estimate and compare the degree of mutational 
robustness for the different FFL motifs, we carried out a 
numerical study calculating the frequency of functional 
shifts upon parametric perturbation of equation (6) as 
shown in figure 5. For a given motif, this can be done 
introducing random mutations in the parameters that 
define characteristically the dynamics of FFL motif (fig- 
ure 5a). We restrict the analysis to that sort of muta- 
tions that does not change the topology of the FFL, i.e. 
mutations that do not change the qualitative type of reg- 
ulations (activation or inhibition) described by a x , /3 X 
and /P. 

The suffered mutations are reflected (or not) in a qua- 
litative change of the BR. Here different scenarios are 
possible, i) mutations that do not affect qualitatively the 
nullclines' geometry, i.e. there are no changes in the BR 
(neutral), ii) mutations that are reflected in compatible 
changes in the BR, but the new BR is associated to the 
same dynamic than the previous one (neutral), and 
finally iii) mutations that are reflected in compatible 
changes in the BR, and the new BR is associated to a 
different dynamic (qualitatively changing mutations). 

In our numerical study we have considered 1000 dif- 
ferent sets of parameters for each FFL type (8000 cir- 
cuits in total). For each FFL, the mutational process is 
repeated 10.000 times until the probabilities of func- 
tional shifts stabilize (figure 5c) and the effects of the 
accumulation of mutations can be analyzed. This evalua- 
tion of the transition probabilities does not depend on 
specific parameter values but on the conditional rela- 
tions between them. Since the relations are of the sort a 
> 6, changes in the function may be achieved by very 
small or large parameter changes equally likely, 
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depending only on the conditional dependencies 
between the key-parameters and the corresponding 
values at time-step t - 1. 

Tables 3 and 4 show the probabilities of transition 
between the different dynamics subsumed in a transition 
matrix. These matrices define a transition graph for 
each FFL motif shown in figure 5b. 

The elements of the diagonal correspond to the func- 
tionally invariant mutations, i.e. changes in the BR with- 
out changes in the dynamics. Rows represent the initial, 
columns the final dynamics. In tables 3 and 4 we sum- 
marize the effects of single mutations, whereas in table 
5 and 6 accumulated mutations are presented, i.e. multi- 
ple conditions of the BR changed in a successive 
manner. 

The sum of the diagonal elements determines the 
fraction of the mutations without impact on the 
dynamics. It provides a measure of the robustness 



Table 3 Transition probabilities for single mutations C1- 
C4 



Table 4 Transition probabilities for single mutations 11-14 
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Table 3,4,5 and 6 list the transition probabilities between functions for every 
motif and its respective function probability distribution. The values are 
obtained from numerical simulation. 
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against perturbations. As our data show, all FFLs are 
highly robust against single mutations, they show similar 
values above 90%. In other words, upon single muta- 
tions the FFLs display low sensitivity against mutation, 
hence a low evolvability s. However, there occurs a sig- 
nificant change in the degree of evolvability if the effect 
of accumulated mutations is studied. We quantify the 
evolvability e{Ti) of a given topology as: 



£ (r fc ) = i 



(7) 



where Ei ^(0x100 represents the robustness against 
accumulated and J2i ^fe(0il0i) re P resen ts the robustness 
against single mutations. Since 

Ei «fe(0.#i) > Ei n?(0.#i)> we have 0 < S(T k ) < 1. 

Entropy, kurtosis and the likelihood of appearance of FFL 
motifs 

The degree of homogeneity of the distribution of prob- 
abilities for the implementation of any function can be 
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Table 5 Transition probabilities for accumulated 
mutations C1-C4 
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Table 6 Transition probabilities for accumulated 
mutations 11-14 
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used to characterize the level of flexibility or specializa- 
tion or, complementarily, the plasticity of a given motif 
IV Different measures can be used to quantify the 
homogeneity of a distribution, namely kurtosis (see 
Results and Discussion) or entropy. Here we point out 
some more details on the calculation of the extreme 
cases and their kurtosis and how the Shannon entropy 
can be applied equally effective for this task. The 
expression for kurtosis can be written [30] as: 



K{r t ) - 



n(n+ 1) 



(n-l)(n-2)C 



P{<Pj\Ti)-<P>\ 



3(n-l) 2 
(n-2)(n-3) 



(8) 



where <P > is the mean value of the probability distri- 
bution and O" is the standard deviation. 

Considering the extreme case where the most specia- 
lized motif T s only implements a single function (p lt i.e. 
P((pj | T s ) = 1 if / = 1 and 0 otherwise, expression (8) 
reduces to: 



^•n(n+l)(n-l)-3(n-l) 2 
(n - 2)(n - 3) 



(9) 



On the other hand, the most flexible motif Tf imple- 
ments all possible functions with equal probability P((pj | 
Tf) = 1/n. Here expression (8) leads to a mathematical 

indetermination -, because P((pj | Tj) = <P >. This inde- 



0 



termination can be solved calculating the limit 

lim K(r f ). 

p( ( p j \r f )^<p> 



(10) 



Applying L'Hopital's rule [31], expression (10) reduces 



to: 



K{Tf) 



[n+ l)(n- 1) -3(n- l) 2 
(n - 2)(n - 3) 



(11) 
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The calculated kurtosis values are K(T S ) = 6 and K(Tfj 
= -3.33, knowing that n = 6 (number of different possi- 
ble functions). 

Finally, we apply the Shannon entropy [32] to our data 
set to describe the qualitative differences in homogene- 
ity of the motifs' probability distributions and develop a 
measure for functional specialization. It is defined as 

6 

H(ro = -j^nmiogiipwm (12) 

Again, we will first consider the extreme cases T s and 
Tf to determine the range of all possible values. We find 
for the most flexible case P((pj \ TJ) = 1/6 an entropy of 
H(Tf) = log 2 (6). For the most specialized case T s (P((p\ \ 
T s ) = 1 and 0 otherwise) the associated entropy is H(T S ) 
= 0. Any FFL motif will have entropy values residing 
within this range. When correlating the FFLs' entropy 
and their abundance in figure 4b we find that the most 
abundant motifs show intermediate values. This trend 
coincides with what has been found when applying kur- 
tosis, where too, CI and II show intermediate kurtosis 
values. They do neither exhibit high flexibility nor high 
specialization, indicating that a trade-off between both 
features is associated to the most abundant motifs. It 
can be understood in terms of adaptability or in other 
words the motifs' plasticity to explore the landscape of 
possible dynamics under mutational pressure. 

Acknowledgements 

We thank the members of the Complex Systems Lab for fruitful discussion. 
This work was supported by the EU grant CELLCOMPUT, the EU 6th 
Framework project SYNLET (NEST 043312), the James McDonnell Foundation, 
the Marcelino Both Foundation, the University of Vienna and by the Santa 
Fe Institute. 

Author details 

department for Computational Systems Biology, University of Vienna, 
Althanstr. 14, A-1090 Vienna, Austria. 2 Complex Systems Lab (ICREA-UPF), 
Barcelona Biomedical Research Park (PRBB-GRIB), Dr. Aiguader 88, 08003 
Barcelona, Spain. 3 Santa Fe Institute, 1399 Hyde Park Road, Santa Fe 
NM87501, USA. 

Authors' contributions 

SW: study conception, research design, mathematical analysis, manuscript 
writing. RS: study conception, research design, manuscript writing. JM: study 
conception, research design, mathematical analysis, manuscript writing. All 
authors have read and approved the final version of the manuscript 

Received: 11 October 2011 Accepted: 19 January 2012 
Published: 19 January 2012 

References 

1 . Kirschner M, Gerhart J: Evolvability. Proc Natl Acad Sci USA 1 998, 95:8420-7. 

2. Wagner A: Robustness and evolvability: a paradox resolved. Proc R Soc 
London B 2008, 275:91-100. 

3. Ancel LW, Fontana W: Plasticity, evolvability, and modularity in RNA. J Exp 
Zool 2000, 288:242-83. 

4. Stadler B, Stadler P, Wagner G, Fontana W: The topology of the possible: 
formal spaces underlying patterns of evolutionary change. J theor Biol 
2001, 213:241-274. 



5. Alon U: Network motifs: theory and experimental approaches. Nature 
Reviews Genetics 2007, 8:450-61. 

6. Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U: Network 
motifs: simple building blocks of complex networks. Science 2002, 
298:824-7. 

7. Shen-Orr SS, Milo R, Mangan S, Alon U: Network motifs in the 
transcriptional regulation network of Escherichia coli. Nat Genet 2002, 
31:64-8. 

8. Mangan S, Zaslaver A, Alon U: The Coherent Feedforward Loop Serves as 
a Sign-sensitive Delay Element in Transcription Networks. J Mol Biol 2003, 
334:197-204. 

9. Wall M, Dunlop MJ, Hlavacek W: Multiple Functions of a Feed-Forward- 
Loop Gene Circuit. MB 2005, 349:501-14. 

10. Ozbudak EM, Thattai M, Lim HN, Shraiman Bl, Oudenaarden AV: 
Multistability in the lactose utilization network of Escherichia coli. Nature 

2004, 427:737-40. 

11. Alon U, Surette MG, Barkai N, Leibler S: Robustness in bacterial 
chemotaxis. Nature 1999, 397:168-71. 

12. Barkai N, Leibler S: Robustness in simple biochemical networks. Nature 
1997, 387:855-7. 

13. Ishihara S, Fujimoto K, Shibata T: Cross talking of network motifs in gene 
regulation that generates temporal pulses and spatial stripes. Genes Cells 

2005, 10:1025-1038. 

14. Cotterell J, Sharpe J: An atlas of gene regulatory networks reveals 
multiple three-gene mechanisms for interpreting morphogen gradients. 
Mol Sys Biol 2010, 6:425-438. 

15. Mangan S, Alon U: Structure and function of the feed-forward loop 
network motif. Proc Natl Acad Sci USA 2003, 100:1 1980-5. 

16. Kalir S, Mangan S: A coherent feed-forward loop with a SUM input 
function prolongs flagella expression in Escherichia coli. Mol Syst Biol 
2005,1,2005.0006. 

17. Draghi J, Wagner GP: Evolution of evolvability in a developmental model. 
Evolution 2008, 62:301-15. 

18. Prill RJ, Iglesias PA, Levchenko A: Dynamic properties of network motifs 
contribute to biological network organization. PLoS Biol 2005, 3:e343. 

19. Ghosh B, Karmakar R, Bose I: Noise characteristics of feed forward loops. 
Phys Biol 2005, 2:36-45. 

20. Cordero OX, Hogeweg P: Feed-forward loop circuits as a side effect of 
genome evolution. Mol Biol Evol 2006, 23:1931-6. 

21. Ingram PJ, Stumpf MP, Stark J: Network motifs: structure does not 
determine function. BMC Genomics 2006, 7:108. 

22. Lynch M: The evolution of genetic networks by nonadaptive processes. 
Nat Rev Genet 2007, 8:803-813. 

23. Sole RV, Valverde S: Are network motifs the spandrels of cellular 
complexity? Trends Ecol Evol 2006, 21:419-22. 

24. Sole RV, Valverde S: Spontaneous emergence of modularity in cellular 
networks. J R Soc Interface 2008, 5:129-33. 

25. Mazurie A, Bottani S, Vergassola M: An evolutionary and functional 
assessment of regulatory network motifs. Genome Biol 2005, 6:R35. 

26. Macia J, Widder S, Sole RV: Specialized or flexible feed-forward loop 
motifs: a question of topology. BMC Syst Biol 2009, 31:3-84. 

27. Mangan S, Itzkovitz S, Zaslaver A, Alon U: The Incoherent Feed-forward 
Loop Accelerates the Response-time of the gal System of Escherichia 
coli. J Mol Biol 2006, 356:1073-81. 

28. Conant GC, Wagner A: Convergent evolution of gene circuits. Nat Genet 
2003, 34:264-266. 

29. Teichmann SA, Babu MM: Gene regulatory network growth by 
duplication. Nat Genet 2004, 36:492-496. 

30. Balanda KP, MacGillivray HL: Kurtosis: a critical review. Am Statistn 1988, 
42:111-9. 

31. Spivak M: Calculus. Cambridge University Press; 1994, 201-11. 

32. Shannon CE: A Mathematical Theory of Communication. Bell System 
Technical Journal 1948, 27:379-423. 



doi:1 0.1 186/1 752-0509-6-7 

Cite this article as: Widder et ah Evolvability of feed-forward loop 
architecture biases its abundance in transcription networks. BMC 

Systems Biology 2012 6:7. 



